
A DESIGN METHOD FOR AN 
OPTIMAL ATTITUDE REGULATOR 
FOR A SPINNING SPACE STATION 

by Paul S. Kempfer 

Langley Research Center 
Langley Station, Hampton, Va. 

NATIONAL AERONAUTICS AND SPACE ADMINISTRATION . WASHINGTON, D. C. . FEBRUARY 1967 



TECH LIBRARY KAFB, NM 



TECH LIBRARY KAFB, NM 


DIBDbSb 


A DESIGN METHOD FOR AN OPTIMAL ATTITUDE REGULATOR 

FOR A SPINNING SPACE STATION 

By Paul S. Rempfer 

Langley Research Center 
Langley Station, Hampton, Va. 


NATIONAL AERONAUTICS AND SPACE ADMINISTRATION 


For sale by the Clearinghouse for Federal Scientific and Technical Information 
Springfield, Virginia 22151 — Price $2.00 
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FOR A SPINNING SPACE STATION 

By Paul S. Rempfer 
Langley Research Center 

SUMMARY 

A study was made to show how a method in optimal control could be used in the 
design of a linear-feedback attitude regulator for a spinning space station. The design 
method is for linear systems, and the regulator, operating cyclically, minimizes a final 
error and uses a fixed amount of control effort in each cycle of operation. 

The resulting optimal feedback gains for the optimal regulator are computed and 
presented. The gains are time-varying and complex. For this reason a suboptimal 
regulator with less complex time-varying gains is designed on the basis of the optimal 
system. In addition, for comparison purposes, a standard constant-gain regulator is 
designed. Digital simulations of all three regulators are presented. It is shown that the 
two time-varying-gain regulators require a significantly smaller peak control torque than 
the constant-gain regulator. 


INTRODUCTION 

In the future, it may be desirable to place a large manned space station in orbit 
about the earth in order to carry out various extended orbital missions. In order to pro- 
vide the occupants with an artificial gravity and to provide some attitude stabilization, 
the station may be spun about one of its principal axes. In reference 1, an optimal regu- 
lator control law for linear systems was derived. The optimal regulator operates cycli- 
cally and minimizes a final-error magnitude while using a fixed amount of control energy 
in each cycle of operation. The method is based on the optimal linear regulator theory 
of reference 2. The present paper shows how this method could be used in the design of 
an attitude regulator for a large spinning space station. 

SYMBOLS 

Measurements for the present investigation were taken in the U.S. Customary System 
of Units. Equivalent values are indicated parenthetically in the International System (SI). 



4x4 positive definite symmetric matrix determining relative weighting of 
state variables in terminal- error criterion function 

positive scalar design parameter in weighting matrix A 

4x4 positive definite symmetric matrix determining outer hyperellipsoid 

4x4 positive definite symmetric matrix determining inner hyperellipsoid 

fixed value of control effort 

unit vector directed along desired orientation of space- station spin axis 

4x4 matrix determined by dynamics of spinning space station 

4x1 vector-matrix determined by dynamics of spinning space station 

4x1 vector-matrix of feedback gains 

,h 4 gains in q, r, a, and 0, respectively 

transverse moment of inertia with Iy = Ig’ slu g" foot2 (kilogram-meter 2 ) 

moments of inertia about principal station X-, Y-, and Z-axes, respectively, 
slug- foot 2 (kilogram- meter 2 ) 

4x4 identity matrix 

unit vectors along principal X-, Y-, and Z-axes, respectively 
terminal- error criterion function 

independent constants of motion for free- spinning space station 
any positive integer 
element of matrix M 

4x4 symmetric matrix determining a constant of motion for free- spinning 
space station 



4x4 solution matrix defined by equation (17) 


4x1 adjoint variable vector- matrix 

angular velocities about principal X-, Y-, and Z-axes, respectively, 
radians/second 

positive constant spin rate of space station about X-axis, radians/second 
Laplace transform variable 
time, seconds 

fixed operating time for each cycle, seconds 

control torque applied about Y-axis, foot-pounds (meter -newtons) 
normalized control input defined in equation (lb), radians/ second^ 
constant of motion for free- spinning space station 
principal space- station axes 

4x4 solution matrix for equations of motion for optimally controlled system, 
defined by equation (22) 

4x1 state-variable vector-matrix defined in equation (lb) 
initial state of spinning space station 

angles defining angular misalinement of space station, radians 
positive scalar design parameter arising from control effort constraint 
dummy scalar variable of integration 
dummy scalar variable, T - t 


transition matrix for uncontrolled system 



O total inertial angular velocity of space station, radians/ second 

co frequency defined in equation (lb), radians/second 

— derivative with respect to time 

dt 

Above a variable, a tilde denotes a 4 x 4 matrix, a bar denotes a 4 x 1 vector- 
matrix, and an arrow denotes a vector. Double subscripts on scalars denote elements 
of a matrix. A prime denotes the transpose of a matrix, and a minus-one superscript 
denotes the inverse of a matrix. Superscript o denotes optimal system, superscript s 
denotes suboptimal system, and superscript c denotes constant-gain system. Sub- 
script max denotes a maximum value. The symbol 4 means "is by definition," and 
the symbol £ means "is identically for all time." 

PRELIMINARY REMARKS 

A possible configuration for a spinning space station is shown in figure 1, where 
X, Y, and Z denote the principal station axes. The station is assumed to be spinning 
about the X-axis with a constant rate p Q ; hence, X is referred to as the spin axis. 

The rates about the remaining two axes, Y and Z, are denoted q and r, respec- 
tively. The orientation in inertial space of the unit vector e is that desired for the 
spin axis, and a and p measure the misalinement of the station spin axis from e". 

The angle a lies in the iTZ-plane and the angle p lies in the XY-plane. 

The spinning space station represents a neutrally stable system. If it receives an 
impulsive disturbing torque, its spin axis becomes misalined and moves so as to sweep 
out a cone which, in general, is not centered about e\ In this study, the station is 
assumed to be continually subject to small disturbances which eventually result in an 
undesirably large misalinement. The object of the regulator is to correct this misaline- 
ment in an efficient manner. 

It is desired that the regulator perform its job with the least amount of effort. For 
this reason, as long as the misalinement is small the regulator takes no action. Only 
when the misalinement reaches an unacceptable level does the regulator operate. At that 
time, the regulator acts to reduce both the rates, q and r, and the angles, a and p. 
The regulator operates for a prescribed amount of time and then shuts down until the 
persistent disturbances again cause an unacceptable misalinement. 

A control torque about the Y-axis of the space station supplies the regulator 
"muscle." Any torque-producing devices capable of producing continuously varying torque 
magnitudes, such as control moment gyros or reaction wheels, could be used. Ordinarily, 
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in the design of a regulator, -a constant-gain linear-feedback law would be specified and 
standard Laplace transform techniques used to find the gains. In this study, however, a 
theory in optimal control is used to provide time-varying gains in order to reduce the 
peak magnitude of the torques to a value lower than that required by a constant-gain sys- 
tem. This reduction would ease the demands on the torque-producing device and con- 
ceivably could reduce its weight and power requirement. 

The optimal regulator and its behavior are discussed herein. Its response is com- 
pared with that of a simplified suboptimal regulator and with that of a constant-gain sys- 
tem. All the systems considered are linear, and no control saturation is considered. 

ANALYSIS 


Design of Optimal Regulator 

The equations of motion for the spinning station are derived in appendix A by the 
method used in reference 3. Since the optimal regulator is most simply studied in 
vector-matrix notation, the equations of motion are presented in that form as 


where 


dx 

dt 


Fx + gu 


(la) 


x A 


q 


0 

-Cl) 

0 

0 

r 

FA 

Cl) 

0 

0 

0 

a 
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0 

0 

p o 

0 


0 

-1 

- p o 

0 


gA 


t Y 
uA — — 
- I 


J > 



(lb) 


In equation (lb) Ty denotes the control torque about the Y-axis, I x denotes the 
moment of the inertia about the X-axis, and I denotes the moment of inertia about both 
the Y- and Z-axes. The values for the parameters used in this study are as follows: 

I x , slug-ft 2 (kg- m^) 1.5 X 10 7 (2.0337 X 10 7 ) 

I, slug-ft 2 (kg- m2) 1.0 X 10 7 (1.3558 X 10 7 ) 

p Q , rad/sec 0.314 
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The theory used to design the regulator is presented in reference 1. In the optimal 
theory the activation criterion for the regulator is dependent not only on the misaline- 
ment angles, but on the angular rates as well. Specifically, .the activation criterion is a 
positive definite quadratic function of all the components of the state vector. Thus, the 
regulator takes action whenever the state has reached some four-dimensional ellipsoid 
in state space and operates to drive the state toward the origin. As a result, the state is 
kept within this hyperellipsoid. Each time the regulator is activated it remains in opera- 
tion for a fixed preset amount of time and then shuts down; thus, there is no need for an 
inner dead band to shut the regulator off. The equation for the hyperellipsoid is given by 
the optimal theory; and, although the regulator shutdown is governed by time, the optimal 
theory also gives the equation for a second hyperellipsoid which is inside the first and 
to which the state is driven on each operation. The size and shape of both the inner and 
outer hyperellipsoids are discussed subsequently. 

Each time the regulator acts, the control torque is governed by a linear-feedback 
control law which is optimal in that it minimizes a terminal- error function 

J = ix'(T)Ax(T) (2) 

where A = A' > 0 (i.e., J is positive for any nonzero x(T)) and where T is the fixed 
operating time, while using a given amount of control effort 

E = K T u 2 (t)dt (3) 

2 J 0 

Although the constraint on control effort does not imply a constraint on peak control 
magnitude, it indirectly causes such a constraint since the control is oscillatory and 
periodic. 

The first step in applying the design technique is to choose the A weighting matrix 
for the terminal error J. Next the optimal feedback control law is found as a function 
of certain design parameters available to the designer. The third step is to solve for the 
equations for the inner and outer hyperellipsoids. These hyperellipsoids are dependent 
on the same parameters as the control law. Finally the designer chooses the design 
parameters so that the inner and outer hyperellipsoids satisfy the performance require- 
ments of the regulator. These steps are now followed for the space- station problem. 

Choice of weighting mat rix A .- It can be shown that since the station is neutrally 
stable there is a matrix M = M such that, with no control or disturbance acting, the 
motion of the station satisfies the condition 

V A | x'(t)Mx(t) b Constant (4) 

” Ci 
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Because it satisfies this condition, the quadratic form V is called a constant of motion 
for the uncontrolled system. If this quadratic form is positive definite, it makes an ideal 
choice for the terminal- error function J, inasmuch as, when J is minimized and the 
regulator is shut down, except for small disturbances, J remains at its minimized value. 
The matrix M is then considered as a choice for A. 

The matrix M is easily found by taking the derivative of V and utilizing the 
equation for the uncontrolled system 

f=FX (5) 

with the result 

— V = J x'(t) (f'M + M0x(t) s 0 (6) 

dt >4 


The fact that FM + MF is symmetric and equation (6) is true for all values of x(t) 
implies that 

F M + MF = 0 (7) 

This equation is solved in appendix B with the result 


M = 


M 


11 

0 

0 

-1 


M 


11 


1 

0 


0 

1 

1.5p c 

0 


-1 

0 

0 

1.5p r 


( 8 ) 


where the element is an arbitrary constant. If the quadratic form V is written 

out, the result is 


V = 




1.5p 0 (a 2 + 13 2 ) + 2 (or - /3q) 


( 9 ) 


It should be noted that, since V is a constant of the motion of the uncontrolled system 
for any value of M^, the functions 

3 1 4 q 2 + r 2 (10) 


J 2 4 1.5p o (o! 2 + /3 2 ) + 2(ar- 0q) 


(ID 


are independently constants of the uncontrolled- system motion. This fact is used 
subsequently. 
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Before M may be used as A in the error function it must be tested for positive 
definiteness. With a useful theorem in reference 4 it is found that, for all positive a, 
where 

a 4 3M n P Q - 2 > 0 (12) 

M is positive definite. The parameter a is introduced here in order to simplify 
results presented subsequently. Since M is positive definite, it may now be used as the 
A weighting matrix. From equation (2) the terminal- error function may then be written 
(for any a > 0) as 


J 




+ 2 (or - /3q) 


(13) 


From equation (13) it can be seen that the parameter a weights the relative importance 
of the rates over the angles in the error function. The parameter a is used subse- 
quently as a design parameter. 


Determination o f the optimal control law.- First, the adjoint vector p, which has 
the same dimension as x (i.e., four), is introduced. The adjoint equations are then 
written as 


= -Fp 
dt 

P(T) = Ax(T)| 


(14) 


The theory presented in reference 1 indicates that the optimal control is of the form 


u°(t) = - ^ g'p(t) 


(15) 


where A is another positive design parameter which arises from specifying the amount 
of control effort. Substituting the optimal control into equation (la) results in 


dx 

dt 


Fx 


gg P 


x(0) = x 


(16) 


By solving equation (16) and the adjoint equations together, the optimal control may be 
found as a function of time alone. However, the optimal control as a function of time 
alone is an optimal open-loop control and not the optimal closed-loop control law desired. 
In order to obtain the desired feedback law, p is determined in terms of x by intro- 
ducing two solution matrices. 
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r 


The first solution matrix is for the adjoint equation (14) relating p(t) to x(T) and 
is denoted P(T - t). This matrix is 4 x 4 and each column satisfies the adjoint equation 
so that 


dP 

dt 


(T - t) = -F P(T - t) 


P(0) = A 


(17) 


From the terminal conditions on p and the equation linearity, the vector p may now 
be written as 

P(t) = P(T - t)x(T) (18) 


Substituting the vector p 


into equation (16) results in 


dx 

dt 


= Fx - i gg P(T - t)x(T) 


x(0) = x Q 


(19) 


This is an equation for the optimally controlled system in terms of its initial and final 
states. 

Let 4>(T - t) denote the transition matrix for the uncontrolled system so that 


^r(T - t) = f£(t - t)[ 

dt 

4>(0) = I 


( 20 ) 


With 1>(T - t), the solution to equation (19), written in terms of the final state as the sum 
of its uncontrolled and controlled response, is 


i pT-t „ 

x(t) = $(T - t)x(T) + i \ $(T - t - u)gg P(o)x(T)da (21) 

x J 0 

The x(T) is then taken outside the integral and factored out of the two terms to give 


x(t) = 


1 r T - t ~ 

4>(T - t) + — \ <f>(T - t - a)gg P(a)da 

X Jo 


x(T) 


( 22 ) 


By denoting the quantity within the brackets as X(T - t), the second solution matrix, 
equation (22) is rewritten 

x(t) = X(T - t)x(T) (23) 

It is noted that although X(T - t) does not satisfy the requirements for being the general 
transition matrix for the optimally controlled system equations, it is shown in reference 2 
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to be nonsingular. Thus equation (23) may be solved for x(T) as 


x(T) X" (T - t)x(t) 


( 24 ) 


and x(T) may then be substituted into equation (18) to give p as a function of x as 
desired. The resulting equation is 


p(t) = P(T - t)X _1 (T - t)x(t) 


(25) 


When p(t) is used in the optimal open-loop control, the optimal closed-loop control law 
is found to be 


u°(t) = - i g P(T - t)X -1 (T - t)x(t) 

A 


(26) 


From this law the vector of time-varying gains may be written as those elements multi- 
plying the elements of x(t). This vector is denoted h (t) and is written in row- vector 
form as 


h°'(t) = - ig’P(T - t)X -1 (T - t) 
A 


(27) 


so that the optimal feedback control law is 

u°(t) = hj°(t)q + h 2 °(t)r + hg 0 ^)^ + hj°(t)j3 (28) 

It now remains to explicitly solve for the solution matrices of the space station. By 

letting r » T - t and taking the Laplace transform of equation (17) with respect to 

d d 

r and by noting that — *= - the resulting equation is 

sP(s) - A = F P(s) (29) 

where s denotes the Laplace variable. This equation is then solved for P(s) to give 

P(s) = (si - f') _1 A (30) 

Since P(T - t) is not used alone, but only as P (T - t)g, and since g has only one non- 
zero component, the matrix multiplication is carried out before inverting the Laplace 
transform and thus the inversion is greatly simplified. The result is 


P (T - t)g = 

Jp o 


a cos 0.5p Q (T - t) + 2 cos p Q (T - t) 
a sin 0.5p o (T - t) - 2 sin P Q (T - t) 
-3p o sin p o (T - t) 

-3p o cos p q (T - t) 


( 31 ) 


where a can be defined as in equation (12). 
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With P(T - t)g known, the expression for X(T - t) can now be transformed with 
respect to r with the result 

X(s) = (si + f)" 1 + i(sl + F) _1 gg P(s) (32) 

where (si + f) is seen from equation (20) to be <f>(s). ^Again, note that ^ 

In order to avoid the laborious Laplace inversion and subsequent matrix inversion, rather 
than solving for X(T - t) analytically, the differential equation 


^(T - t) = FX(T - t) - ± gg P(T - t) | 
dt X 


X(0) = I 


(33) 


was integrated and the matrix inverse taken on a digital computer to obtain X'^T - t). 
With the digital solution, the time-varying gains could be found by using specified values 
for the design parameters a and X and the operating time T. 

Equations for inner and outer hy perellipsoids. - As shown in reference 1, the equa- 
tion for the inner hyperellipsoid is 

x'(T)Dx(T) = E (34) 

/W ^ ! 

where E is the magnitude of the control effort and D = D >0 is a 4 X 4 matrix found 
by carrying out the integration 


Since the elements of 
operating time 


~ _ T ~ 

D = — s \ P (r)gg P(r)di 


(35) 


2\* 

P (r)g are periodic, the integration is simplified by choosing the 


T = l = 40Z 

0.5p n 


(l = 1, 2, 3, . . .) (36) 


The result is 


D = 


36X^p 


2r> 2 


2? + 4 
0 
0 

-®Pr» 


a^ + 4 
6Po 


0 - 6 P 0 

6 Pq 0 

9p o 2 0 

0 9 Pq 2 


(37) 
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The equation for the inner hyperellipsoid is then written as 


E = 


T 

a 2 + 4 

6* 2 P 0 

[ 6p o 


(q2 + r 2 ) + 1.5p o (o!2 + /3 2 ) + 2 (or - /3q) 


( 38 ) 


The quadratic form on the right side of this equation is a constant of motion of the uncon- 
trolled system since it, like J, is simply a linear combination of Jj and j£. Thus, 
when the state is deposited on the inner hyperellipsoid, except for the disturbances, it 
stays there. 

As shown in reference 1, the equation for the outer hyperellipsoid is 

x'(0)Bx(0) = E (39) 

where B = B >0 and when B is a 4 X 4 matrix defined by the relation 

B = X” 1 (T)DX -1 (T) (40) 

It can be seen that both hyperellipsoids depend on the control effort E and that the D 
and B matrices depend upon the design parameters a and A and the operating 
time T. Hence, the hyperellipsoids may be shaped and sized with these parameters. 

Choice of the design p arameters.- The first parameter chosen was the operating 
time T. In this study, T was chosen independent of hyper ellipsoid considerations to 
be 240 seconds, which was considered a reasonable operating time since nearly the same 
space station had been satisfactorily controlled in this time by a bang-bang controller in 
reference 5. The remaining parameters were used to adjust the hyperellipsoids. First, 
an approximate allowable angular misalinement of 0.1 radian was chosen. Then it was 
necessary to decide how much this misalinement should be reduced each time the regula- 
tor was activated. A reduction factor of 10 was chosen so that the terminal misalinement 
would be about 0.01 radian. Finally, the allowable magnitude of the terminal rates was 
chosen by considering reference 5 and finding that a ratio of terminal angle to terminal 
rate of 3.75 seconds was acceptable. Thus the approximate terminal rates would be 
0.01 rad/3.75 sec = 0.00266 rad/sec. With all these requirements, the design parameters 
were easily found. 

Since an analytic expression for the inner hyperellipsoid (eq. (38)) was available, 
this hyperellipsoid was considered first. An angle a at the inner hyperellipsoid of less 
than about 0.01 radian was desired. With equation (38), the point on the inner hyperellips- 
oid at which a is a maximum could be found with the theory of constrained extremals. 
Then the parameters could be set so that a would satisfy the requirement exactly. 

Rather than following this procedure herein, however, the point a = 0.01, /3 = q = r = 0.0 

was chosen to lie on the inner hyperellipsoid for the sake of simplicity. Clearly this 
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point would not be the point of maximum angular misalinement but it was believed to be 
acceptable for the present example. Similarly, for simplicity, q = 0.00266 rad/sec was 
chosen to lie on the inner hyperellipsoid when a = j3 = r = 0. When these two sets of 
values were substituted into the equation for the inner hyperellipsoid, the two resulting 
equations were 

(240) (0.01) 2 = E (41) 

4A 2 


(240)(0.00266) 2 (a 2 + 4) _ E 
36p 0 2 x 2 


(42) 


By substituting the value of p Q , the following solutions were obtained: 

a = 2.9 (43) 


= 0.0775 
\/E 


(44) 


The outer hyper ellipsoid condition was then met by choosing E. Since no analytic 
expression for the B matrix was available, E was chosen by trial and error on a digi- 
tal computer. First, a small value of E was guessed. Then A was determined from 
equation (44) and was used in equation (33) to solve for X(T). By substituting X(T) into 
equation (40), the matrix B was computed. This matrix determined the outer hyper- 
ellipsoid. Again, for simplicity, rather than using the theory of constrained extremals, 
the condition that a= 0.1, /3 = q = r = 0.0 lie on this hyperellipsoid was then checked. 

Since a small value of E had been chosen, the hyperellipsoid was too small and fell 
within the checkpoint. The value of E was then successively increased until the check 
condition was met. This procedure was quite simple and required only four iterations. 
The result was 

E = 8.13 X 10"® (45) 

Thus, all the parameters were chosen and they completely fixed the two hyperellipsoids 
and the optimal feedback control law. The characteristics of the hyperellipsoids and the 
optimal gains are now discussed. 
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Study of Optimal Regulator 


The matrix B was computed to be 


B = 


7.34 x 10" 3 

-1.80 x 10“ 7 

-1.43 x 10" 7 

-1.73 x 10" 3 

-1.80 x 10" 7 

7.30 x 10" 3 

1.73 x 10“ 3 

4.10 x 10" 8 

-1.43 x 10" 7 

1.72 x 10" 3 

8.12 x 10“ 4 

3.25 x 10" 8 

-1.73 x 10" 3 

4.10 x 10" 8 

3.25 x 10" 8 

8.14 x 10“ 4 


( 46 ) 


When the quadratic form x Bx is written out, with terms of the order 10 or less neg- 
lected, an approximate but very close equation for the outer hyperellipsoid may be written 
as 


0.00173 


4.23 (q 2 + r 2 ) + 1.5p o (o! 2 + 0 2 ) + 2(cKr - /3q) 


= E 


(47) 


Like J and the inner-hyperellipsoid quadratic form, this expression is also a constant 
of the motion of the uncontrolled system since it is simply a linear combination of 
and J 2 » Although it is difficult to visualize either hyperellipsoid because they both lie 
in four-dimensional space, some information may be gained by looking at certain cross 
sections. (See fig. 2.) 

From either of the two hyperellipsoid equations, it can be seen that a cross section 
in or parallel to the aft- or qr-plane is a circle. When either both the rates or both the 
angles are zero, the circle has its largest radius and encircles the origin as shown in 
figures 2(a) and 2(b). The three points used to choose the design parameters are shown 
in these figures. From the equations, it may further be seen that cross sections in or 
parallel to the ra- or q/3-plane are ellipses which are tilted because of the cross-product 
terms. When both /3 and q or both r and a are zero, the ellipses are at their 
largest and are centered about zero as shown in figures 2(c) and 2(d). In these figures, 
it is seen that the approximate angular misalinement requirements are exceeded because 
of the tilting of the ellipses. Further study of the hyperellipsoid equations shows that the 
maximum violations of the requirements occur in these cross sections. For the present 
problem the violations were considered acceptable. It should be pointed out that, if they 
had not been acceptable, more conservative approximate errors could have been chosen 
and the design process repeated once or twice with very little additional work. Use of 
the theory of constrained extremals previously mentioned could also have been made but 
was considered unnecessary for this study. 
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The time- varying optimal gains are presented in figure 3. The gains generally 
increase with time and there is clearly a greater dependence on q and /3 than on r 
and a. 


Design of Suboptimal Regulator 

In applying the optimal regulator, it is necessary to monitor the function 
0. 00173(4. 23J i + J 2 ) where Jj and J 2 are the constants of the uncontrolled motion, 
which are defined in equations (10) and (11). When this monitored function reaches the 
value E = 8.13 x 10 "^ the state would be on the outer hypersurface, since equation (47) 
for the outer hyperellipsoid is satisfied, and the control would be turned on. The control 
law then uses stored values for the time-varying gains and the whole state vector is fed 
back. This process is complex since it involves storage and many multipliers, and there- 
fore a simplified suboptimal regulator was considered. 

Since the gains on a and r for the optimal control were small, they were taken 
as zero for the suboptimal control. Thus, the dependence on a and r was eliminated. 
Next, it can be seen in figure 3 that the q gain, although negative, is nearly proportional 
to the /3 gain for the entire operation. In the suboptimal control, therefore, the constant 
of proportionality was estimated and used to give 

h x s = -4.23h 4 s (48) 

Then the time variation lay all in one gain h 4 s . In order to eliminate the necessity of 
storing this gain, the following simple time function approximating it was constructed: 

h 4 s = 0.0037 [exp(0.0085t) - 0.0077t)J (49) 

The nonzero suboptimal gains resulting from the approximations are compared with the 
optimal gains in figure 4. Thus, the suboptimal control has been simplified to 

u s = h 4 s (/3 - 4.23q) (50) 

Since the dependence of the control on a and r has been eliminated, it is desir- 

able for Jj and J 2 to be determined without measuring a and r. The need for 
measuring a and r would then be eliminated completely. A possible method for such 
a determination of Jj and J 2 is presented in appendix C, although it was not actually 
simulated in this study. In all simulations the suboptimal control system was initiated 
on the Outer hyperellipsoid with the same initial conditions as the optimal system. The 
behavior of the two systems is compared in the section "Results and Discussion." 
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Constant- Gain Control 


In order to compare the optimal and suboptimal regulators with a standard constant- 
gain regulator which performs the same function, a constant-gain regulator was designed 
by the use of only q and /3 feedback as is required for the suboptimal control. It was 
believed that the ratio of 0 gain to q gain used in the suboptimal control law was prob- 
ably a good one and so this ratio was used in the constant-gain system 

hj c = -4.23h 4 c (51) 

The constant gain h^ c was then found by trying values for h 4 c , digitally computing the 
system response, and then comparing the resulting terminal motion with the terminal 
motion of the suboptimal system. When the terminal motions appeared equivalent, that 
value which had been used was the gain. The value in the present study was 

h 4 c = 0.01 (52) 

The constant- gain feedback control was then 

u c = 0.01(/3 - 4.23q) (53) 

In the next section the performance of this system is compared with that of the other two 
regulators. 


RESULTS AND DISCUSSION 
Optimal Regulator 

Figures 5 to 7 show the performance of the optimal regulator operating from vari- 
ous points on the outer hyperellipsoid. These points are representative of the initial con- 
ditions the regulator might encounter. It is important to notice the constant amplitude of 
the periodic control variable evident for all three initial conditions. This constant ampli- 
tude is a general characteristic of the optimal regulator. Because of this characteristic, 
the maximum magnitude of control torque which the regulator requires at any time may 
be estimated. The maximum magnitude occurs when the regulator operates from initial 
conditions in the q/3-plane where the straight line of the equation 

/3 - 4.23q = Constant (54) 

just touches the ellipse. The points satisfying this requirement (one of which is the ini- 
tial condition of fig. 5) are shown in figure 2(d). The estimated maximum magnitude of 
control torque required is found to be 5200 ft-lbf (7050 m-N). 
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The constant- amplitude characteristic of the optimal control is responsible for the 
linear decay envelop of the rates and angles. For the initial conditions in figures 5 to 7, 
the envelop indicated a time to damp to hal* amplitude of 130 seconds. 

Suboptimal Regulator 

Figures 8 to 10 show the performance of the suboptimal regulator operating from 
the same initial conditions as the optimal regulator in figures 5 to 7. The time histories 
appear to be the same as for the optimal regulator during about the first 180 seconds. 

This similarity exists because the suboptimal gains approximate the optimal gains almost 
exactly for this time period. As the approximation becomes less exact, the suboptimal 
control drops slightly in amplitude. The result is that the angles and rates left after the 
suboptimal regulator shuts down are slightly larger than those left by the optimal regula- 
tor. The difference, however, is very small. The time to damp to half amplitude is the 
same as for the optimal regulator — that is, 130 seconds; however, because the sub- 
optimal control amplitude drops off toward the end of each operation, the suboptimal con- 
trol effort is less than the optimal control effort. For the simulations in this study the 
value of E used by the suboptimal regulator is from 1 percent to 7 percent lower than 
that used by the optimal regulator. The dropping off of the control amplitude does not, 
however, affect the magnitude of the required control torque, which is estimated at 
5200 ft-lbf (7050 m-N), the same as for the optimal system. 

Constant- Gain Regulator 

Figures 11 to 13 show the performance of the constant-gain regulator operating 
from the same initial conditions as the optimal and suboptimal systems. The behavior 
of this regulator is the standard constant-gain behavior. The control amplitude drops off 
as the angles and rates decay and thus an exponential envelop for the variables is formed. 
It is important to note that, although the terminal rates and angles are comparable to 
those of the two systems discussed previously, the time to damp to half amplitude is much 
shorter - 66 seconds. The price that the constant-gain system pays for this time reduc- 
tion is more control effort. The value of E used by the constant- gain regulator is from 
42 percent to 44 percent higher than that used by the optimal regulator. More important, 
the maximum control-torque magnitude required by the constant-gain regulator is 
13 820 ft-lbf (18 737 m-N). (See fig. 11.) This value is 2.7 times the maximum control- 
torque magnitude required by the optimal or suboptimal regulators. Since only terminal 
error is of interest, this result indicates that the optimal and suboptimal regulators make 
much more effective use of a given magnitude of available control torque than does the 
constant-gain regulator. 
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CONCLUDING REMARKS 


An optimal linear-feedback regulator has been designed for the attitude control of 
a spinning space station. The optimal' regulator employs time-varying gains to make con- 
tinued use of its available peak control torque. In contrast, a constant-gain system uses 
its available peak control torque only in the initial phases of its control operation. Thus, 
the magnitude of the peak control torque that the optimal regulator requires is signifi- 
cantly lower than that required by the constant-gain system. This reduction in torque- 
magnitude requirement is obtained, however, at the expense of system complexity. For 
this reason, a suboptimal system, which is much simpler than the optimal system but 
which has near- optimal performance, has been designed. The suboptimal system is more 
complex than the constant-gain system, but this complexity may be warranted in a space 
station where available control torque is at a premium. 

Langley Research Center, 

National Aeronautics and Space Administration, 

Langley Station, Hampton, Va., November 8, 1966, 

125-19-04-01-23. 
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APPENDIX A 


VEHICLE DYNAMICS 

The rate equations of motion for the spinning vehicle are simply a special case of 
Euler’s dynamical equations (see ref. 6) with the following assumptions employed: 

(1) Symmetric inertia distribution, that is, Iy = Iz = I 

(2) No component of torques about the X or Z axes 

(3) The space- station moment of inertia about the spin axis satisfies the inequality 
I X >I. 

With these assumptions, Euler's dynamical equations are written as 



(Al) 

I f - (I - I X )0 r - t y 

(A2) 

I f-( I x‘ I ) pq '° 

(A3) 

From equation (Al) the spin rate is constant and is denoted p Q . Equations (A2) and (A3) 
may then be rewritten 

t— : ? 

(A4) 

£-**-0 

(A5) 

where 


“ - (t - *>. 

(A6) 

The angle equations are written with the assumption that e is fixed in inertial 
space. With respect to the body axes, e then satisfies the equation 

|-e + fixe = 0 
dt 

(A7) 

where 


S2 = P q T + qj + rk 

(A8) 
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APPENDIX A 


and i , j , and k are unit vectors along the X, Y, and Z axes, respectively. From 
figure 1 it may be seen that in the body- axis coordinates the vector e is 

e" = cos a cos /3 i + cos a sin /3 j - sin ok (A9) 



and that the products a /3 qo:, and r/3 are small and may be neglected. These 

assumptions have been shown to be good for vehicles in motion with angular misaline- 
ments of less than 15° (see ref. 7). The resulting equations are 

af-p 0 ' 3 = -‘i (A11) 

gf + P Q “ = -r (A12) 

Equations (A4), (A5), (All), and (A12) compose the set of equations used in this study. 


20 



APPENDIX B 


SOLUTION FOR MATRIX M 


The equation which must be solved is written 

FM + MF = 0 

where M = M . Since M is symmetric, equation (Bl) may be rewritten 

(mf)' + MF = 0 

In order to use this equation, M is set up as 


and multiplied by F to give 


2 P o M 12 " M 13 


ip M,, - M 14 
2 r o 11 14 


■Po M 14 


J ivJ-i o 

O lo 


2 P o M 22 M 23 


2 P o M 12- M 24 


o 24 


p o M 23 


2 p o M 23 M 33 


2 P o M 13 M 34 


o 34 H o33 


2 P o M 24 ” M 34 


2 P o M 14 " M 44 


o 1 44 f o34 


Then MF is transposed and added to itself term by term to arrive at equation (B2). 

In equation (B2) there are 16 algebraic equations, not all independent, which are solved to 
give M. The result is 


" M 33/ 1,5p o 


l 33/ 1,5p o 


[ 33A 5p o 


- M 33A- 5p o 


APPENDIX B 


where both and Mgg are arbitrary constants. Notice that M may be rewritten 


as 


M = M 


33 


M ll/ M 33 

0 

0 

-i/i.SPo 


M ll/ M 33 V 1 - 5 ^ 


l/i.5p o 

0 


1 

0 


J / l * 5p o 

0 

0 

1 


(B6) 


which shows M to be arbitrary in the "magnitude" Mgg. The reason is that equa- 
tion (Bl) is homogeneous and thus the magnitude does not reflect any system character- 
istic. In this study, Mgg was chosen to be 1.5p Q in order to avoid the fractions in 
M. Thus, the result is 


M 


11 


M = 


0 

0 

-1 


M 


11 


1 

0 


0 

1 

1.5p c 

0 


-1 

0 

0 

1.5p r 


(B7) 


22 



I 


APPENDIX C 

METHOD FOR DETERMINING Jj AND J 2 

Since, for the suboptimal system, the dependence of the control on a and r has 
been eliminated, it is desired that Jj and J 2 be determined without measurement of 
a and r. The need for measuring a and r would then be eliminated completely. 
Such a determination of Jj and J 2 is possible, but it requires that the disturbance 
effects be small over any 40- second beat period of the space- station motion. With this 
requirement, the equations of motion for the uncontrolled system are 


$ - -°- 5 V 

(Cl) 

ar 0 - 5p o q 

(C2) 


(C3) 

m =- r - p o“ 

(C4) 


From equation (Cl) it is evident that, when q is a maximum, r is zero so 

T _ n 2 

J 1 Vax 

If equation (11) is rewritten as 


Jo = 


(l.5p o O!+r) 2 + (l.5p 0 /3- q) 2 - Jj 


2 1-5P 0 

then, with the equations of the uncontrolled motion, it may be found that 


(C5) 


(C6) 


^(i.5p 0 ^- q)=-p 0 ( 1 .5p 0 «+ r ) 


(C7) 


so that the second term within the brackets in equation (C6) is a maximum when the first 
term within the brackets is zero. The resulting equation is 


J 2 = 




(C8) 


The function which must be monitored to determine whether the state has reached the 
outer hyperellipsoid may then be written in terms of q^,,,. and (l.5p R - qL ov , 

lUciX \ O /ITlclX 

Thus the need for a and r sensors in the suboptimal system is eliminated. 
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Figure 1.- Spinning space station. 
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A Specified points used to choose 
design parameters 
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Figure 2.- Cross sections of the inner and outer hyperellipsoids. 
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□ Initial conditions giving maximum 
control torque 
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(d) Cross section in the qp-plane. r = 0; a = 0. 
Figure 2.- Concluded. 
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Figure 4.- Nonzero suboptimal gains compared with the optimal gains. 
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Figure 5.- Behavior of optimal regulator operating from the initial condition q = 0.033, r = 0, a = 0, p = 0. 
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Figure 6.- Behavior of optimal regulator operating from the initial condition p = 0.1, q = 0, r = 0, a = 0. 
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Figure 7.- Behavior of optimal regulator operating from the initial condition r = 0.033, a = -0.141, q = 0, p = 0. 


32 



rad /sec 


v.v-v. 




- 05 l 



Figure 8.- Behavior of suboptimal regulator operating from the initial condition q - 0.033, r = 0, a = 0, p = 0. 
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Figure 9.- Behavior of suboptimal regulator operating from the initial condition p = 0.1, q = 0, r = 0, a = 0. 
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Figure 11.- Behavior of constant-gain regulator operating from the initial condition q = 0.033, r = 0, a = 0, 3 = 0. 
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Figure 13.- Behavior of the constant-gain regulator operating from the initial condition r = 0.033, a - -0.141, q = 0, 13 = 0. 
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